A Comparative Analysis of Methods for Probability Estimation Tree
نویسندگان
چکیده
In this paper, we address the problem of probability estimation of decision trees. This problem has received considerable attention in the areas of machine learning and data mining, and techniques to use tree models as probability estimators have been suggested. We make a comparative study of six well-known class probability estimation methods, measured by classification accuracy, AUC and Conditional Log Likelihood (CLL). Comments on the properties of each method are empirically supported. Our experiments on UCI data sets and our liver disease data sets show that the PETs algorithms outperform traditional decision trees and naïve Bayes significantly in classification accuracy, AUC and CLL respectively. Finally, a unifying pseudocode of algorithm is summarized in this paper. Key-Words: Probability estimation tree, Decision trees, Classification, Joint distribution, AUC, Conditional log likelihood
منابع مشابه
Evaluation of estimation methods for parameters of the probability functions in tree diameter distribution modeling
One of the most commonly used statistical models for characterizing the variations of tree diameter at breast height is Weibull distribution. The usual approach for estimating parameters of a statistical model is the maximum likelihood estimation (likelihood method). Usually, this works based on iterative algorithms such as Newton-Raphson. However, the efficiency of the likelihood method is not...
متن کاملBayes Networks and Fault Tree Analysis Application in Reliability Estimation (Case Study: Automatic Water Sprinkler System)
In this study, the application of Bayes networks and fault tree analysis in reliability estimation have been investigated. Fault tree analysis is one of the most widely used methods for estimating reliability. In recent years, a method called "Bayes Network" has been used, which is a dynamic method, and information about the probable failure of the system components will be updated according to...
متن کاملApplication of Fuzzy Fault Tree Analysis on Oil and Gas Offshore Pipelines
Fault Tree Analysis (FTA) as a Probabilistic Risk Assessment (PRA) method is used to identify basic causes leading to an undesired event, to represent logical relation of these basic causes in leading to the event, and finally to calculate the probability of occurrence of this event. To conduct a quantitative FTA, one needs a fault tree along with failure data of the Basic Events (BEs). Someti...
متن کاملApplication of Fuzzy Fault Tree Analysis in Risk Assessment of Ammonia Tank Explosion Scenario
Introduction: Chemical industries often have risks for the environment and communities, due to the use of complex facilities and processes. Also, in the ammonia tanks, the probability of risk of explosion is high, owing to their specific characteristics. The aim of this study is to evaluate the risks of explosion scenario at the ammonia tank in the Kermanshah petrochemical complex Material and...
متن کاملتجزیه و تحلیل ریسک انفجار ایستگاههای تقلیل فشار گاز شهری (TBS) با استفاده از روش شناسایی حالات نقص و تحلیل اثرات آن ((FMEA و تکنیک تجزیه و تحلیل درخت خطا (FTA)
Background and aims: Attention to safety of process plants adjacent residential area, especially in the big cities is extremely importance from the perspective of the safety engineering and risk management. This study was carried out whit the aims of identification of hazard points of explosion in Town Border Station (TBS) and qualitative and quantitative analysis of their occurrence causes and...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011